BLEU Evaluation of Machine-Translated English-Croatian Legislation

نویسندگان

  • Sanja Seljan
  • Marija Brkic Bakaric
  • Tomislav Vicic
چکیده

This paper presents work on the evaluation of online available machine translation (MT) service, i.e. Google Translate, for English-Croatian language pair in the domain of legislation. The total set of 200 sentences, for which three reference translations are provided, is divided into short and long sentences. Human evaluation is performed by native speakers, using the criteria of adequacy and fluency. For measuring the reliability of agreement among raters, Fleiss' kappa metric is used. Human evaluation is enriched by error analysis, in order to examine the influence of error types on fluency and adequacy, and to use it in further research. Translation errors are divided into several categories: non-translated words, word omissions, unnecessarily translated words, morphological errors, lexical errors, syntactic errors and incorrect punctuation. The automatic evaluation metric BLEU is calculated with regard to a single and multiple reference translations. System level Pearson’s correlation between BLEU scores based on a single and multiple reference translations is given, as well as correlation between short and long sentences BLEU scores, and correlation between the criteria of fluency and adequacy and each error category.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic and Human Evaluation on English-Croatian Legislative Test Set

This paper presents work on the manual and automatic evaluation of the online available machine translation (MT) service Google Translate, for the English-Croatian language pair in legislation and general domains. The experimental study is conducted on the test set of 200 sentences in total. Human evaluation is performed by native speakers, using the criteria of fluency and adequacy, and it is ...

متن کامل

Evaluating Statistical Machine Translation from English to Dutch

In this paper, I attempt to evaluate the effectiveness of using statistical machine translation to translate an English text into Dutch, using empirical evaluation and the Bleu evaluation metric. I also give a brief overview of the theory behind statistical machine translation and automated translation evaluation metrics. I have translated a sample of the English proceedings of the European Par...

متن کامل

Domain Adaptation of Statistical Machine Translation using Web-Crawled Resources: A Case Study

In this research, we tackle the problem of domain adaptation of Statistical Machine Translation by exploiting domainspecific data acquired by domain-focused web-crawling. We design and empirically evaluate a procedure for automatic acquisition of both monolingual and parallel data and their exploitation for system training, tuning, and testing in a phrase-based Statistical Machine Translation f...

متن کامل

Statistical pattern-based machine translation with statistical French-English machine translation

We developed a two-stage machine translation (MT) system. The first stage consists of an automatically created pattern-based machine translation system, and the second stage consists of a standard statistical machine translation (SMT) system. For French-English machine translation, we first used a French-English pattern-based MT, and we obtained ”English” sentences from French sentences. Second...

متن کامل

The Correlation of Machine Translation Evaluation Metrics with Human Judgement on Persian Language

Machine Translation Evaluation Metrics (MTEMs) are the central core of Machine Translation (MT) engines as they are developed based on frequent evaluation. Although MTEMs are widespread today, their validity and quality for many languages is still under question. The aim of this research study was to examine the validity and assess the quality of MTEMs from Lexical Similarity set on machine tra...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012